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1 Approximations to the twin prime conjecture 
before 2000 


In the present work we will give a survey about some new results concerning small 
gaps between consecutive primes. In the following let ? denote the set of the primes, 
Pn the n prime and d, the n difference between consecutive primes, that is, 


d, = Pami — Pn (1.1) 
The famous Twin Prime Conjecture, 
d, =2 infinitely often, (1.2) 


would answer the most important problem about the smallest values of d,, which oc- 
curs infinitely often. However, we will investigate also various conjectures and results 
which do not follow from (1.2). 
We will also discuss the well-known generalization of (1.1), formulated in 1849 by 
de Polignac [18]: 
d, = 2k infinitely often for any k € Z*. (1.3) 


We will call an even number 2k a Polignac number if d,, = 2k infinitely often. 
Polignac’s conjecture (1.2) can be formulated with this definition as 


Polignac Conjecture. Every even number is a Polignac number. 


The most important weaker form of the Twin Prime Conjecture and the Polignac 
conjecture is the 
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Bounded Gap Conjecture. We have liminf d,, < 00, or, equivalently, there is at least one 
Polignac number. 


If we are looking for small gaps between consecutive primes, then the starting 
point is that the Prime Number Theorem, 


dt 
a(x) ~ ~ | —, (1.4) 
log x 7 logt 
implies (in fact, it is even equivalent to) 
1d 
lim — tosi, 1.5 
Nos N 2 logn (5) 
Therefore we have 3 
A, = lim inf — <1. (1.6) 
n= logn 
On the other hand, any estimate of type 
A, <1 (1.7) 


is already non-trivial. 

Equation (1.7) was proved first under the assumption of the Generalized Riemann 
Hypothesis (GRH) in an unpublished work of Hardy and Littlewood in 1926 (see [19]) 
providing the conditional estimate 


A < z under GRH. (1.8) 


The first unconditional estimate was shown only in 1940 by Paul Erdős [5], who 
proved 
A sa<l, (1.9) 


with an effective but unspecified value ç < 1. After various refinements of (1.9), a large 
step was done by the unconditional estimate 


A, < 0.467 (1.10) 


of Bombieri and Davenport [2] in 1966 based on the large sieve of Bombieri [1]. After 
several smaller sharpenings, Helmut Maier improved (1.10) in 1988 to 


A, < 0.2486 (1.11) 


using his famous matrix method [16]. 
The mentioned results before 2000 left open the question whether A, = 0, which 
can be formulated as the 


Small Gap Conjecture. A, = 0. 
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2 Distribution of values of d,,/ log n 


The Small Gap Conjecture is equivalent to the fact that the number zero belongs to the 
set of limit points of the sequence {d,,/ log n}2,. In the following, let J denote this set, 


i.e. 
d I 
J= F n } y; (2.1) 
ogn 


Erdős [6] and independently Ricci [20] proved that J has positive Lebesgue mea- 
sure and Erdős formulated the conjecture that every nonnegative number is a limit 
point of the sequence {d,,/ logn}, that is, 


J = [0, 00). (2.2) 


However, the paradoxical situation was that no finite limit point of J was known. (The 
result co € J was already known by the result of Westzynthius [23].) 

We would like to remark here that neither the Twin Prime Conjecture nor the Poli- 
gnac Conjecture would imply any limit point of the sequence d,,/ log n beyond the sin- 
gle number zero. 


3 Results between 2000 and 2010 


The Small Gap Conjecture, equivalently 0 € J, was shown finally in a joint work with 
D. A. Goldston and C. Y. Yıldırım in 2005. 


d 
Theorem A ([9], [10]). lim inf — = 0. 
n= logn 


Somewhat later we showed the above in a much stronger quantitative form as 


Theorem B ([11]). lim inf a < œ 
noo (log n)'/?(log log n)? 

The method of proof of Theorem A provided a surprisingly strong estimate con- 
cerning small values of d,,, the truth of the already mentioned Bounded Gap Conjec- 
ture, if we suppose that primes are sufficiently uniformly distributed in arithmetic pro- 
gressions. In order to formulate the result we need the following 


Definition. Primes have an admissible distribution level 9 if 


X X 
max| log p - — | «4. — (3.1) 
2 Gat 2 9(q) (log X)4 
p=a(mod q) 


for any constants A > 0, € > 0 and any X > 2. 
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Concerning known results about admissible distribution levels 9 for the primes, 
the strongest result is due to Bombieri [1] and A.I. Vinogradov [22], the celebrated 
Bombieri-Vinogradov Theorem. This asserts that the value 9 = 1/2 is an admissible 
distribution level for the primes. 

The strongest possible conjecture about the maximal distribution level of the 
primes was expressed by Elliott and Halberstam ((4]) right after the proof of the 
Bombieri—Vinogradov theorem. 


Elliott-Halberstam Conjecture (EH). 9 = 1 is an admissible distribution level for the 
primes. 


Our previously mentioned result can be formulated as follows: 


Theorem C ((10]). If there is an admissible level 9 > 1/2 of the distribution of primes, 
then 
lim inf d„ < C(9) < oo. (3.2) 


In particular, if the Elliott-Halberstam conjecture is true, then 
lim inf d,, < 16. (3.3) 
n> 
Theorem A implies for any e > 0 an infinitude of prime gaps satisfying 
d,, < logn. (3.4) 


However, it left open the interesting problem whether (3.4) occurs with a positive fre- 
quency for any given € > 0. Goldston and Yıldırım [13] showed that this is true if 
€ > 1/4. We have to mention that Maier’s method [16], leading to a slightly smaller 
gap of size (1.11), did not yield small gaps with a positive frequency (unlike the result 
(1.10) of Bombieri-Davenport [2], which in fact, did). 


4 Recent results 


The aim of the present section is to give a survey about the most recent results in this 
area. The first result is a sharpening of the hitherto best estimate of Theorem B. 


d 
Theorem 4.1. lim inf ———-—*———- < 0. 
n>09 (log n)3/7(log log n)*/7 
The next one is acommon generalization of Theorem C and the celebrated result 
of Green and Tao [14] according to which there are arbitrarily long (finite) arithmetic 
progressions in the sequence of primes. We need the same conditions as in Theorem C 
but the consequence is stronger. 
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Theorem 4.2 ([17]). For every 9 € (1/2, 1] there is an integer C(9) and a positive even 
number d < C(9) with the following property. If the primes have an admissible level 9 of 
distribution, then for any positive integer k we have a k-term arithmetic progression of 
primes { py, such that p; + d is the prime following p; for all elements of the progres- 
sion. If the Elliott-Halberstam conjecture is true, i.e. 9 = 1, or, at least 9 > 0.971, then 
the above is true withad < 16. 


As we mentioned earlier, the existence of an admissible level 9 > 1/2 for the distri- 
bution of primes implies the Bounded Gap Conjecture (cf. Theorem C), consequently 
the existence of at least one Polignac number. However, we can prove more under the 
same condition, namely 


Theorem 4.3 ([17]). If 9 > 1/2 is an admissible level of distribution of the primes, then 
Polignac numbers have a positive lower density at least c' (9). 


Thesame assumption 9 > 1/2 gives also further information about the limit points 
J of the sequence {d,,/ log n}°,. Thus, under the above deep hypothesis we obtain fur- 
ther progress in the problem of Erdős beyond the fact 0 € J (following from Theorem A 
unconditionally). 


Theorem 4.4. If 9 > 1/2 is an admissible level of distribution of the primes, then we 
have a number C, (9) (depending in an ineffective way on 9) such that 


[0,C,(9)] cJ. (4.1) 
Concerning the frequency of small gaps, i.e. the gaps satisfying 
d, < log p, (4.2) 


for a small but fixed € > 0, we could show in a joint work with D. Goldston and C. 
Yıldırım the following 


Theorem 4.5 ([12]). The sequence S, of indices satisfying (4.2) has a positive lower den- 
sity for any e > 0. 


5 Main ideas of the proofs of Theorems A, B, C 


The first step in proving Theorems A-C is to investigate the following far-reaching gen- 
eralization of the Twin Prime Conjecture. This was formulated more than a hundred 
years ago in a qualitative form by Dickson [3] and nearly two decades later (probably 
independently) in a quantitative form by Hardy and Littlewood [15]. In the formulation 
of the conjecture we shall call a k-tuple H = {hye of different nonnegative integers 
admissible if for any prime p J does not cover all residue classes mod p. 
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Conjecture DHL. If + is admissible, then there are infinitely many values n such that all 
n+ h; (i =1,2,...,k) are simultaneously primes. 


The next step is the formulation of a weaker form of the above DHL Conjecture 
(which is weaker than Polignac’s conjecture and it is equivalent to DHL in the simplest 
case k = 2). 


Conjecture DHL (k, 2). If H is an admissible k-tuple, then there are i, j € {1,2,...,k}such 
that for infinitely many values n we haven + h; € P, n + h; € P. 


We remark that the truth of the conjecture DHL (k, 2) for any particular k implies 
the Bounded Gap Conjecture. 

The strategy is to attack the above Conjecture DHL (k, 2) for some large values of 
k. We will miss it (thereby also the Bounded Gap Conjecture) just by a hairbreadth. 
However, this will help us to prove Theorem A and will immediately furnish a proof of 
the conditional Theorem C. 

Let us consider the product 


k 
Pac(n) = | [m+ h). (5.1) 
i=1 


If we try to attack the original very deep conjecture DHL for any given k € Z* it is 
plausible to use the weights of Selberg’s sieve, 


a= {a y ua (tog ®) F n € [N, 2N) (5.2) 


d|P y(n) 
d<R 


with some R < N!?, 
In fact, we can evaluate the sum of these weights for n € [N,2N) and obtain for 
R < N”? (log N)™ the asymptotic 


N 
Ay= È a~ zS log R)", (5.3) 
ne[N,2N) $ 


where v, = v (H) denotes the number of different residue classes covered by H mod p 


and 5 (30) 7 
z _ Yp ye 
S(H) := I] (1 z ) (: ) . (5.4) 


We note that G(H) > 0 if and only if H is admissible. 
Let us denote by yp(n) the characteristic function of primes, i.e. 


xp(n) =1lif ne ?, yp(n) =0 if n¢€ P. (5.5) 


Then we can investigate the frequency of those n ~ N for which n +h, € F, i.e. 
xp(n+h;) = 1 for any given h; € K, or more generally of those n for which n + h € P 
with some h ¢ H. 
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It turns out that in order to be able to evaluate the sum 


By(h):= È axp(n+h) (5.6) 


ne[N,2N) 


we need a much stronger condition than R < N'/?(logN)“, namely R < N®9? 
(which is only in the hypothetical case 9 = 1, i.e. in case of the truth of the Elliott- 
Halberstam Conjecture near to the bound N 1/ * (log N mcr which is the barrier for the 
evaluation of A, in (5.3)). 

If the evaluation of the quantities A y and By (h) is successful, we can investigate 
the average weighted number of primes among {n+ ht, if n runs through the interval 
[N, 2N). 

In case of h; € H we obtain 
2N (logR)*! 2Ay logR 


eee logN — k+1 logN’ en 


By(h,) ~ 


Hence, if we choose the maximal possible R, namely, we set 


R= NP, (5.8) 
then we obtain A 
By(h; 
By(h;) 9-€ 2 nhi) k (9-8) (5.9) 
An k+l? An k+1 ` 


which is unfortunately less than 1 even under the assumption of the Elliott-Halber- 
stam conjecture and unconditionally it is even less than 1/2. 

However, if we consider the weaker conjecture DHL (k, 2), we have no heuristic 
reason to consider only the k-dimensional Selberg’s sieve, which is a truncated version 
of 


1 P(n k 
Ax) E Ma (tog =) (5.10) 
k! dsm) d 
a(n. 
which approximates the situation 
n+h,eP, i=1,2,...,k. (5.11) 


This observation opens the door towards the introduction of other, higher dimen- 
sional sieves, or, equivalently, to the introduction of a larger class of possible weights 
depending on a new parameter £ > 0. Thus, let us consider the new weights in the 
form 


2 
$ u(d) log'** R) (5.12) 


d<R 
d|P5¢(n) 


a(n, R, Heks £) = ( = 


and the analogous quantities A y(£), By(h,€) instead of (5.3), (5.6), corresponding to 
the case £ = 0. 
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The new quantities can be evaluated too, if k and £ are fixed, N — oo and R = 
N®-9/2. (It is sufficient to suppose R < N®-®/? but larger values of R yield better 
results.) However, surprisingly enough, we obtain nearly twice as many primes in a 
k-tuple n + H on average if n runs through the interval [N, 2N) under the simple con- 
dition that £ and k/£ are both simultaneously large (for example £ = [vk] and k is 
large). We obtain, namely, in case of h; € H in place of (5.3), (5.7) and (5.9) 


2€ k+2€ 
N 2l +2 logR k+2€ 
Bulls) ~ eae JS) pea ER Hy) 


Consequently we obtain for any fixed e > 0 instead of (5.9) the better asymptotics 


By(h,t) 29-8) (1- 1 ). (5.15) 


An(é) k+2€+1 2€+2 

È By(h; £) 

i=l 2€+1 ( ) 

He Pe )(1- 29-2 4 
Ay() ae ə(1 am) leaga PRA G16) 


ifk > Ce’, € = [ Vk/2], for example. 

The result (5.16) immediately yields that choosing the weighted average implied 
by the weights in (5.12) we obtain on average for n € [N,2N) more than one prime 
among {n + hj, if we suppose the crucial condition 


9>1/2 (5.17) 


which implies Theorem C or more generally DHL (k, 2) for k > C,(9). 
However, in the unconditional case we have only 9 = 1/2 by the Bombieri—Vino- 
gradov theorem. Hence we obtain, say, more than 


1 - 4e (5.18) 


primes on average for n € [N,2N) among {n + h} so we fail by a hairbreadth to 
prove the Bounded Gap Conjecture (or Conjecture DHL (k, 2) for some large bounded 
values of k). 

On the other hand, we have still the possibility to count the weighted number of 
primes in a short interval 


he[i,H], H=nlogN, hg K, (5.19) 
where yis a small positive constant. An argument, similar to that yielding (5.13)—(5.14) 
gives in this case for any admissible k-tuple Hy 


An(é) G(H, Uh) 


By(h, £) ~ 
Kien lgN G(H;,) 


> (5.20) 
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consequently 
H 
È By, 8) 
h=1 ht Hy g -( 5 SP) i (5.21) 
An(£) mim, CG) /logN 


In the original work [10] a beautiful result of Gallagher [8] 


Hk 
$ S(H,) ~ — (kfix, H > 00) (5.22) 
Hy Cl HI Il=k k! 
was used along with an averaging over all possible k-tuples H, € [1, H] to obtain on 
average over n € [N,2N) further 7 primes among n + h with h € [1, H] \ Hy which 
finally led to more than one prime in some interval 


[n,n+ylogN] if 4 > 4e (5.23) 


with a suitable n € [N,2N), N > No(e). This proved Theorem A, the relation A, = 0. 
However, it might be interesting to note that Gallagher’s estimate can be avoided and 
one can show (even in a simpler way) for any fixed admissible 4, either the relation 


3 6(H, Uh) 


S(Hy, H) = SH) ~ 5 


h=1,h4 H; 
or in a simpler way the weaker inequality 
S(H,,H) > c*H for H > H)(k) (5.24) 


with a suitable absolute constant c* > 0, which is still enough to show Theorem A. We 
obtain a particularly simple proof if we show (5.24) just for some suitably chosen H, 
(for any large k), which is again sufficient for the proof of Theorem A. 

The proof of the quantitative Theorem B is based on the same ideas but it also re- 
quires many further ideas and is technically much more complicated. We just mention 
that in [11] we apply similar weights but the parameters k and £ tend to infinity with 


Nas 
4/1 
Z _ Vlog N > €= Vk. 
(log log N)? 


Furthermore, we have to replace Gallagher’s theorem (5.22) with an inequality 
(shown in a completely different way) which is still valid if 


(5.25) 


k < eH/ log H (5.26) 


(for which range (5.22) is probably no longer true). 

Additionally we have to replace Bombieri-Vinogradov’s theorem by a more gen- 
eral one which gives a non-trivial estimate also in the case when the moduli run only 
through a sparse sequence, namely through the multiples of a given modulus 


M < xtleslosx. (5.27) 
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6 Main ideas of the proofs of Theorems 4.1-4.5 


As we mentioned in the last section, the proof of Theorem 4.2 needed a choice of k, £ > 
co as N — ov (in the way given in (5.25)). This represented the essentially optimal 
choice of the parameters k, £ if we work with weights of type (5.12). It turned out already 
in [10] that for concrete small values of k (like for k = 6 in case of (EH), i.e. 9 = 1) it 
might be more advantageous to substitute the expression (log x)*° in (5.12) by a linear 
combination of similar terms with different values of £ (like £ = 0 and £ = 1 in case of 
k = 6, 9 = 1). This idea is crucial in obtaining Theorem 4.1, where a suitably chosen 
polynomial 
L, 
P(logx) = È by(logx)’, Li, L3 = L= k"? (6.1) 
L, 

is used in the definition of the weight corresponding to a(n, R, Hy, £) and k is chosen 


instead of (5.25) as 
k = (log N)?” (log log N)*”. (6.2) 


At first sight it may seem paradoxical that a linear combination of weights per- 
forms better than the best term itself. However, the new weight a), induced by P(log x) 
in (6.1), is itself not a linear combination of the earlier weights a,,(). In fact, the rela- 


tion between them is 
L, 2 
al = ( y ce \a,(€)) cp = blk +O)! (6.3) 
L, 


The proofs of the further Theorems 4.2-4.5 are naturally different and require sev- 
eral new ideas. However, they all need an important common basis. This base is that 
the weights a,, are concentrated on integers with the property that all prime factors of 
all ‘coordinates’ n + h; (i = 1,...,k) are at least of size N” with a small constant 7 > 0 
(depending on k). This can be expressed by Lemma 4 of [17] which we formulate here 
as 


Main Lemma. Let N® < R < N'/°* (log N), y > 0. Then we have for the weights in 
(5.12) for any £ < k and any admissible k-tuple H,, 


X  anRHpl) Ky È} a(n R, Hp £). (6.4) 
ne[N,2N) ne[N,2N) 
Ap<N",p|P a(n) 

This result makes it possible to reduce the summation in all proofs with a bounded 
value of k to numbers n where all elements of {n + ht, are almost primes. This in- 
formation provides a significant help in the proofs of Theorems 4.2—4.5. We have, for 
example, for these numbers automatically 


a(n, R, Hy £) 


logt R Se S 
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while all other n values having a small prime factor of P4,(n) can be neglected. This 
property is, in fact, crucial in the proofs of Theorems 4.2 and 4.5. 


7 Proof of Theorem 4.1 


The difference compared with the proof of Theorem B, that is Theorem 2 of [11], is that 
instead of a weight function of type 


lo R K+é BEN 
Sa K= 8, e= VK (7.1) 
og R (loglog N) 
we use a linear combination of such type weights, 
K+é 
S log R ) s L, 
P d , B(x) Y bx (7.2) 
( log R 2 
where the parameters now satisfy 
log N \>” 
k=4 (785) » Ly Ly = M = [C,K"?/6] (7.3) 
2 


where log, x denotes the v-fold iterated logarithm, c is a small constant, C, is a large 
one. 
Since the proof of Theorem 2 in [11] uses the conditions (cf. (4.1)—-(4.2) of [11]) 


cylog N 


< (log, N)?’ lbh = VK, (7.4) 


we will first check the needed changes for the proofs of the modified Theorems 4 and 
5 of [11] under the new condition: 


0,0) =M=K’'?, (7.5) 


This requires to check Sections 5-11 of [11] under the new condition (7.5). In contrast 
to Section 5 we will now choose 


M 


NA (7.6) 


0 
After a careful checking of Sections 5-10 we find that the results of Sections 6-9 and 
the most part of Section 10 remain valid and only (10.1), Lemma 9 and Section 11 need 
any revision. (We remark here two small errors in the original work [11]. First, in the a0 
line of (8.3) j has to be replaced by j’ and in (8.9) log, N has to be replaced by log, N, 
but the last inequalities of both formulae remain unchanged valid.) 
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According to the new definition of ô, in (7.6) we obtain now by (7.12) of [11] instead 
of (10.1) with the contours £4 and £’ defined in (5.3) of [11]: 


ID(s, -s)| < e©“D(0,0) for s € Lh. (7.7) 


Consequently, this implies together with Lemma 8 instead of the original Lemma 9 
the corresponding inequality 


ID(s, -s)| < ee” max{1, t D(0,0) for se £. (7.8) 


We can now turn to the necessary changes in Section 11. The new form of Lemma 9, 
the above (7.8), means that in (11.1) we have now 


ID(s, -s)| < e“D(0,0) for se L’ (7.9) 


instead of the old |D(s, -s)| <e D(0, 0) and so we can change e toe in the 
S 
jnd and a lines of (11.1) too. So we have now 


„ &-(ut+v) 


B; (s, Hi, H3) « e™ (CK? log, N)’ -pr DO 0), (7.10) 
S 


and consequently we obtain instead of (11.2) of [11] now 
C! (Hi, H) < e™ (CK? log, NY’ 8“ D(0,0). (711) 


Due to u, v = M and (7.6) we obtain in place of (11.3)—(11.4) 
eM D(0, 0)5)“*”*? (log R)! 4 ( CK? log, N ) 


(d-1)! log R (e 


Lı < 
j'=0 


eC“) D(0, 0)(log Rt! (KM* log, 1) 
(d—1)! 
D(0, 0)(log Rye CK?M"! log, N u+v+1 
(d+u+v)! log R 


< 


D(0,0)(log R)” 


aar S Ny 


and F 
D(0, 0) (log R)” = _caflog N 
L «x DOs RY i N)? eg NEN. (7.13) 
(d+u+v)! 

The final few lines of Section 11, including the crucial formula (11.5) remain there- 
fore unchanged valid. Consequently, the original Theorem 4 of [11] remains valid un- 
der the new choice (7.3) of the parameters. The same refers to Theorem 5 of [11] since 
the evaluation of the crucial integral there is obtained by the above modification of 
Sections 5-11 of [11] and no change is required in the formulation of the modified 
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Bombieri-Vinogradov type theorem (Theorem 6 of [11] in Section 12 or in the argu- 
ments of Section 13), yielding the proof of Theorem 4.5. 

Since Section 14 is independent of the others, we have to turn our attention to the 
last section, Section 15, which finishes the proof of Theorem 2 of [11]. 

The major change in Section 15 arises from the use of the weight function with the 
polynomial 


2M £ K+£ 
Por Y (a)l) ro O0- De- a 
242 


which was analyzed in the last section of [7] where we proved that (see (34) of [7]) 


Dx (K + C,K™*) - 4Ex 2 0 om 
with the notation 
T g Sg € a (2 rA ) (K/2)%*®K! K + b + A (7.16) 
a £ =M b, =M (K+ +& +N & +1 | 
a oe 
e At gI 
no SARE). om 
£ =M €,=M (K+€ +&)I\ & 
24, 248 


which corresponds to (43)—(44) in [7]. 
We would like to mention that in the argument of Soundararajan [21] he works 


K 
with a weight function of type (we defined P(n) = [](n + h,) in (5.1) 
i=1 


2 
uara (R2 )Y. aas) 


n,l ( 
Kc[1,h] \d|Pac (n) dR 
IH|=K 


In contrast to this, we used in our works [10], [11] more complicated weights of the 


form 
log(R/d) 
axa = ( y 2 mapy (EO OER )): (719) 


FCA d|Pa(n) 
[H|=K d<R 


however, with the simple polynomial P(x) = x“*’, @ = VK. (A is in our case an arbi- 


trary set of integers < N with H elements, so similar to but more general than (7.18).) 
In the present work we will use weights of type (7.19) with the polynomial (7.14). 
The different structures of the weights (the order of squaring and summation over 
k-tuples H,) actually yield different optimization problems for the polynomial P(x) 
but the difference is not significant in the most crucial case when we consider (after 
squaring in (7.19)) pairs H, H, when (H; U H3) \ (H, N H) is relatively small in size. 
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This is the reason why despite the different optimization problems we can use here 
the polynomial from [7], which is quasi-optimal for the weights of type (7.18). 

Additionally we remark that the question of optimization of weights of type (7.19) 
cannot be formulated in such a simple and nice way as in the case of the weights of 
type (7.18), analyzed in [21] and [7]. We also note that some nice functions can replace 
the polynomial P, (x) but it is hard to predict in advance what sort of properties we ex- 
actly need for the function in question so that the asymptotic evaluation of the quan- 
tities Ay and B,(h) in (5.3)-(5.6) should be possible. We just mention here that the 
case K = ,/log N, £ = K“, « > 1/2 would already cause serious problems in [11], even 
in the simple case of P(x) = x*?. 

Finally, we remark that although the optimization problems are similar at weights 
of type (7.18) and (7.19) the final result for gaps between consecutive primes would be 
much weaker in case of the use of weights of type (7.18), namely (log N)? [5+ compared 
to our present (log N)*/7**. (In case of P(x) = x** the two methods yield gaps of size 
at most (log N)/>** in case of (7.18), whereas (7.19) led to gap size of at most (log N)!” 
in [11], quoted as Theorem B here.) 

The proof of our new result follows closely the arguments of Section 15 of [11], how- 
ever the final result for the analogue of the crucial quantity S} in (15.9) (the analogue of 
Sp in (10.1) of [10]) is more complicated due to the use of the polynomial P(x) = Px(x). 
Additionally, we have to change the value of most parameters. What makes the proce- 
dure more difficult is the fact that Section 15 relies on Section 10 of [10], so its proof is 
already not self-contained in [11] either. Below we will describe the necessary changes 
compared to Section 15 of [11] (all numbers of formulae beginning with 14 or 15 refer 
to [11]). 

We do not change the values of R,0,ë and V given in (15.2) but in place of 
(15.3)-(15.5) we choose in accordance with (7.3) (with a sufficiently small constant 
c and a large enough C,) and the last section of [7] 


log N 7 1/3 M 
K= > M=|C K" /6], == 7.20 
(EX) [CK /6, =z (7.20) 


1 K? _ logR logN 


= Z = 721 
X= Toor? 10M? H 4H ve) 


> K\? 
To= be ry = K(1-99) =K-9M, 1,=K(1-79)=K-7M. (7.22) 


Under this choice we have f(r + 1) < f(r) forr > rọ and 


x N 2 2 \2M 
AUE I] h(E) (25) s) eg (7.23) 
f(r) 19<rsr, E 1-79 

analogously to (15.16); furthermore we have with C, = 2 Cig 3 from (7.21) 


25logN-M? 2 
H~ og 5 


a a KOK? log N = C, (log N)?” (log, Nn)”, (7.24) 
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This choice of parameters implies that klog, N/H will be smaller than any posi- 
tive constant if c) is chosen sufficiently small and k < 2K arbitrary. Consequently, by 
Lemma 16 of [11] we obtain for the average of the singular series 


S% (k) = = $ S% (7.25) 
HcA,|H|=k 
(cf. (14.6)-(14.7) of [11]) 
S% (k + 1) > e™ S$ (k), (7.26) 


hence in analogy with (15.18) 


S% (2K —1) s eM /40 (7.27) 
max S*,(2K - r) , 
K2r>r, 
The formulae (7.23) and (7.27) together yield for any K >r > r; 
FNSH OK =r) < FSK = 1) < FSK = eM. (7.28) 


Using the notation 0(n) = logn ifn € P and 0(n) = 0 otherwise, we can evaluate 
the crucial quantity 
K\aites 
(5) 


$ 2M 2M e e 
SEF di Die ee È 9(55) 9(44) (K+ @,)\K + @,)! (22) 


H cA H,cA &=M 8, =M 
IH, l=K |H,l=K 24€, 246, 


x E Ags Hy b)i Hy, e( Ş an+ h)- log(3N)) 
N<ns2N heA 
neA(IC, NA(Iy) 


by the aid of the modified Theorems 4 and 5 of [11] for all pairs H, H, ¢ A, |H;| = k, 
where now 


log È K+é 
ARwHey= X wn (5 = (7.30) 
d|P5¢(n) 08 
dsR 


the difference to Ap(n;H, £) in (3.6) of [11] is just in the normalization by the divi- 
sion with (log RY, Using the same argumentation as in Section 10 of [10] or, equiva- 
lently, that of Section 15 of [11] we obtain for the contribution of all pairs H,, H, with 
IH; N H,| = r instead of (15.9)—(15.10) of [11], taking into account also the different 
normalization 


K 
Sr 2 > SASAK - 1) Fey (7.31) 
r=0 
with 
2M 2M £+. 
£ £ K/2) r! (€, + € 
Fer= YY (2) (2) RO (8% 4) (732) 
EZM UM M M/ (r++ & 
24e, 248, 
k (€ + & + ICE, + & +2) K mers Es . 
(£ + 1)(€, + 1) r+€,+@,+1 log N 
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Ifr < ri = K — 7M then we will show that Fg, is positive and consequently so is 
f(r)S%, (2K — r)Fx,. The connection between Fx, Dg and Ex (see (7.15)-(7.16)) is 


5/14 
log, N 
Fx = KDx - 4Ex{ 1+ Of | —2— (7.33) 
? log N 


(This is why we asserted that the weights of type (7.18) and (7.19) lead to slightly differ- 
ent but similar optimization problems.) First we note that it is easy to see that 


3 
Fx, > 0 for rsKk(1- >), (7.34) 


since in this case the quantity in the brackets is already positive. Using the notation 
€,+1l=u,€,+1=v € (M,2M] we have, namely, in this case 


(u+v)(ut+vu-—1) K 1 1 K 
a ee a (7.35) 


1 2 4 1 
>4(1-5—) (1+ =) 2445 > 44. 
2M M M (log N)1/7 
In the crucial case aie 
KES PAS KM (7.36) 


the evaluation and estimation of F,,. is nearly exactly the same as that of Dg and Ex 
in the last section of [7] leading to (7.15). The only difference outside the brackets is the 


change 
ltl 1 €,+€, 


Tl =w to J] =i (737) 
i=1 


i=1 


but the two quantities are in case of (7.36) of the same order of magnitude, namely, 


jK = jr) (140 (+))" = j(r). (7.38) 


This means that the procedure in [7] yields for these values of r, similarly to (7.15) for 
any r with (7.36) 


€, +2. 
Sone pee 
EZM -M M M/ (r+8& +8)! 
248 248 


l (€,+€,4+ (0 +&+2) K+C,K" 


-4 i 
(€, + 1)(€+1) K+, +6 +1 | =o (7.39) 


If we show that for r < r; = K — 7M the above quantity in the brackets is smaller than 
the one appearing in the definition of Fg, in (7.32) for any pair (€,, £), then Fx, > 0 
holds, since the terms outside the brackets in (7.39) are all positive. Taking into account 
M = [C,K"?/6], it is sufficient to show 


(7.40) 


K+6M__. K 0 (eX) 
K+é,+@,+1 r+€,+@,4+1 log N ` 
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Let 
r=K-A, A>7M, m=€,+8€,+1€ (2M,4M). (7.41) 
Then we have 
K+6M K 6M(K-A — KA 
K+6M__K_ _ _6MK-A+m) -KA (742) 
K+m K-At+m (K +m)(K —-A+m) 
6MK —- 7MK M 1 
< ——— = -— < -——_, 
2K2 2K (log N)2/7 


which thereby proves (7.40) and so settles the case r < r}. 

The analysis of the last section of [7] teaches us that the maximal possible order 
of magnitude of |F,,.| does not change significantly for r € [r,, K], so due to the strong 
decrease of the additional factor f(r) we have in view of (7.28) in fact a similar inequal- 
ity to (7.28) if we multiply all terms with the corresponding Fx, namely for r > r; we 
have, say 

Ff (182K =r) < Fy, fS AK -= roe. (7.43) 


This shows (cf. (7.29)—(7.32)) 
Sg > 0. (7.44) 


This means that for some n € (N, 2N] we have 


$ O(n + h) - log(3N) > 0 (7.45) 
heA 
and so we have h,,h, € Awithn+h,,n +h, € F, therefore two primes p’, p” with 
p'- p" =h -h, € A- A. Inthe general formulation of Theorem 4.2, A ¢ [1, N] NN 
was any sequence with H elements, where 


H > H, = C(log N)” (log, N)”. (7.46) 


In particular we can take A = [1, H] yielding two primes p’, p” with0 < p” — p' < Hy. 


Added in proof for (06/30/2013 and 10/30/2013). Very recently Yitang Zhang (Bounded 
Gaps between Primes, Ann. of Math., to appear) succeeded in showing by an essential 
refinement of the methods sketched in the present paper and by an additional use 
of the fact that only smooth numbers have a significant effect for the sieving process 
(Y. Motohashi and J. Pintz, A Smoothed GPY sieve, Bull. London Math. Soc. 40 (2008) 
no. 2, 298-310) that bounded gaps between primes occur infinitely often. The original 
bound 70 million is now diminished slightly below 5000 by the Polymath 8 project of 
T. Tao. 
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